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ABSTRACT 

A bisection of a graph with an even number of vertices is a partition of the 
vertex set into two disjoint sets of equal size. Given a bisection, the number of edges 
having one end in each of the two subsets of the bisection is called the size of the 
bisection. The bisection size of a graph is the minimum size of all possible bisections 
of the graph. Given a graph with an even number of vertices and a positive integer, 
the graph bisection problem is the problem of determining if the bisection size of 
the graph is less than the given number. The graph bisection problem is known to 
beNP-hard. 

In this thesis, we give probabilistic lower bounds and upper bounds for the 
bisection size of random graphs, graphs in which an edge appears between any two 
vertices with a certain fixed probability, say p, independent of all other edges. In 
particular, we show that, with probability 1, the bisection size of random graphs 

on 2n vertices is greater than or equal to n 2 p — 0(n z / 2 yjp{l — p) j and is less than 

n 2 p — Olnyp(l — p) J. Upper bound and lower bound on the bisection size are 

given in the case p is a function of n, specifically when p = p(n) = £. We also 
consider some heuristics for solving the graph bisection problem. 
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Chapter 1 
Introduction 



In this thesis, we study the graph bisection problem. We shall consider only 
undirected, simple graphs having unit cost on all of its edges, and an even number 
of vertices. We consider the problem of partitioning the vertex set into two disjoint 
sets of equal size which minimize the number of cut edges (edges having one 
endpoint in each subset of the partition). This problem is called the graph bisection 
problem 1 . 

By relaxing some constraints in the graph bisection problem we obtain the 
graph partitioning problem. Specifically, let an undirected, simple graph with costs 
on its edges be given. The graph partitioning problem is the problem of partitioning 
the set of vertices into disjoint subsets, each having cardinality smaller than a 
given fixed number, so as to minimize the total cost of the edges having ends in 
different subsets of the partition. Even though the graph bisection problem seems 
simpler than the graph partitioning problem, it still retains the important feature 
of the latter. Furthermore, an algorithm for solving the graph bisection problem 
can be adapted as a heuristic to approximately solve the general graph partitioning 
problem [KL70]. 

Aside from being of theoretical interest, the graph partitioning problem serves as 
an abstraction for various practical problems. An example is the problem of placing 
components of a circuit on printed circuit boards. The objective here is to perform 



'For simplicity we use this definition here. The graph bisection problem will be defined more 
carefully in the next chapter as a decision problem. 
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the placement so as to minimize the number of connections between components 
on different boards, for these connections are slower and more expensive than those 
connecting components on the same board. This problem can be represented as 
a graph whose vertices depicting the components and the edges representing the 
connections of the circuit. The maximum cardinality of each subset of the partition 
corresponds to the maximum number of components each board can have. 

Another example arises in the management of a page or segmentation memory- 
structure of a computer system. The problem here is to assign components (e.g. 
subroutines or procedures) of a large program into different fixed-size pages of the 
memory so that the number of references between components on different pages 
of the memory is minimized. 

In recent years, the graph partitioning problem has been used as a model for 
an important problem in the Very Large Scale Integration (VLSI) design process. 
This problem arises in the placement phase of the design process, in which tens or 
hundreds of thousands of components must be placed on a wafer subject to certain 
constraints. With such a large number of components to be arranged, it is desirable 
to have this process automated. At the present, most models for the placement 
problem are of the form of a graph partitioning problem [Br-77]. 

It is known that the graph partitioning problem (when phrased as a decision 
problem) is NP-complete [HR]. That means at the present no algorithm is known 
to solve the graph partitioning problem in time polynomial in the length of the 
input (assuming a reasonable encoding of the problem). Nonetheless, there are 
approximate algorithms, or heuristics to solve this problem. Hence, it is useful to 
know how good a solution a given heuristic can provide, without having to use 
empirical methods. It should be noted that if we consider the graph partitioning 
problem in which each subsets of the partition has size less than or equal to 2 then 
this is the problem of finding maximum matching, also if the size of each subset 
is restricted to be less than or equal to n — 1 where n is the number of vertices in 
the graph then this becomes the problem of finding a minimal cut set of the given 
graph. There exist polynomial-time algorithms for both of these problems. 

It can be easily shown that the graph bisection problem (when phrased as a 
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decision problem) is also NP-complete. There are several heuristics often used for 
this problem, all of which have the form of a hill-climbing search or some variation 
of it. The analysis of the performance of a heuristic would be facilitated if we have 
some information about the objects the heuristic is applied on. In other words, 
we would like the input to the heuristic to be in some structured class of objects. 
To this end we choose random graphs as input to the heuristics. By a random 
graph we mean a graph in which an edge appears between any two vertices with 
a certain probability, possibly dependent on the number of vertices of the graph, 
but independent of all other edges. The performance of the heuristic on this class 
of graphs may give insight to the typical behaviour of the heuristic in practice. 

To determine the performance of a heuristic, we must first know what kind 
of solution we expect to have, that is we must know the size of the optimal cut 
edge set of a random graph. Then by comparing this with the solution given by 
the heuristic we can determine how good the heuristic is. 

The thesis is divided as follows. In Chapter 2 we review some standard graph 
theoretic notions, define the graph bisection problem, introduce models for random 
graphs, and present some known results. In Chapter 3 we give probabilistic lower 
bound and upper bound for the bisection size of random graphs. We review 
some known algorithms for solving the graph bisection problem and present some 
empirical data on the performance of some heuristics on random graphs in Chapter 
4. Finally, Chapter 5 will provide a summary and some open questions. 
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Chapter 2 
The Graph Bisection Problem 



In this chapter we review the known results about the bisection size of graphs. 
To facilitate this/purpose and to lay the ground for later work we first present 
some standajxTgraph notions. We also give the definitions for our models of random 
graphs. The graph bisection problem is also defined formally. 

2.1. Some Graph Theoretic Definitions 

An (undirected) graph G = (V, E) consists of a set V of vertices and a set E 
of unordered pairs of vertices, called edges (we use the notation (a, b) to denote the 
unordered pair a and 6). We consider only simple graphs, that is graphs satisfying 
the condition that, for all v £ V, (v, v) £ E and there is at most one edge between 
any two vertices in V. Let |A| denote the cardinality of the set A. Let G — (V, E) 
be a graph, for each v £ V, the degree of v, denoted by deg(v), is defined as 

deg(v) = \{v)eV\(v,w)e E}\. (2-1) 

Let AC.V, the subgraph of G induced by A is the graph obtained from G 
by deleting all vertices in V — A and all edges in E incident to those vertices in 
V — A. We call G a complete graph if \E\ = ( ' 2 )• A sequence of vertices v\, . . ., v m 
is called a path between vi and v m in G if for all i £ { 1, . . ., m — 1 }, (vi, Ui-fi) £ E. 
If v\ = v m then the path is called a cycle. G is said to be connected if there is a 
path between any two vertices in V. A tree is a connected graph having no cycle. 
If we label the vertices in V with the integers in { 1, 2, . . ., | V\ } so that each vertex 
has a distinct label, then the adjacency matrix A(G) of G, a square matrix of order 
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|V|, whose element in the zth row and the jth column is defined as follows. 

' 10, it{i,j)iE. ^ ' 

Now suppose |V| = 2n and let A,B C V. The pair (A, B) is called a bisection 
of G if A fl B = 0, A U £ = V, and |A| = |B| = n. Let (A, B) be a bisection of G. 
For each o£A define 

m [A,B)[ a ) = degree of a in the subgraph of G induced by A. (2-3) 

ex (A,B)(°) — degree of o in the subgraph ofG induced by {a} (J B. (2-4) 

Similar definitions apply for each b £ B. We shall omit the subscript (A, jB) and 
just write in(a) and ex(o) when there is no confusion. We now define the size of 
(A, B) written | {A, B) | as 

|(A,fl)|=£ex(a) (2-5) 

a£A 

= £>(&) (2-6) 

6€B 

H£|-5(£in(a)+£in(ft)Y (2 _ 7) 

The bisection size of G is defined as 

C(G) = min{ |(A, S)| | (A, 5) is a bisection of G }. (2-8) 

The graph bisection problem can now be stated. 

Instance : G = {V,E), \V\ = 2n, < k < \B\. 

Question : Is C(G) < k ? 

In this thesis we shall deal mainly with a special kind of graphs, namely the 
random graphs. Random graphs were first considered by Erdbs [E59], [E61], and 
were studied in some details by Erdbs and Renyi in [ER59], [ER60]. Random 
graphs have been used in nonconstructive proofs of combinatorial theorems, see, for 
example, the book by Erdbs and Spencer [ES74]. Random graphs have also been 
studied for their own interest and many results concerning random graphs have 
been discovered [BES80, Bo81a, Bo81b, Bo82, BE76, GM75, Mat76, P76]. For an 
extensive bibliography on the literature of random graphs see [Ka82]. The two most 
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often used models of random graphs are closely related. We start with a fixed set of 
n distinct (labeled) vertices. In the first model, we choose each of the \J\ possible 
edges with a fixed probability p, < p < 1, independent of the choices of all other 
edges. In the second model, we take all graphs on n vertices and m edges, where m 
may be dependent on n, and consider them as points of a probability space, having 
equal probability. In the first model we write Q(n, p) to denote the probability space 
of all graphs with a fixed set of n labeled vertices and the probability of a graph 
with m edges is p m qw~ m , where q = 1 — p. In the second model we use Q[n, m) 
to denote the probability space containing all graphs on n vertices with m edges 
< m < \j), and each graph in Q{n, m) has equal probability. Another commonly 
used model of random graphs is the model 5{n,p) with p being a function from I s ? 
to (0, 1). It is useful to define the following concept which will be needed later. Let 
Q be a property of graphs in ${n,p) or Q{n,m). We say almost every (a.e.) graph 
in Q{n,p) has property Q if Pr{ G n , P G 9i n >P) I Gn,p has Q }-> 1 as n->oo. Similar 
definition applies for graphs in Q{n, m). Unless otherwise indicated, all logarithms 
in this thesis are natural logarithms. 

2.2. NP-completeness of the Graph Bisection Problem 

The NP-complete problems are the hardest problems in a class of problems, 
called NP, that can be solved in polynomial time by nondeterministic Turing 
machines. The corresponding class of probems which can be solved in polynomial 
time by deterministic Turing machines is called P. One of the most important open 
questions in theoretical computer science is whether P is equal to NP. This means 
that we do not now know any deterministic, polynomial time algorithms for solving 
an NP-complete problem. (Note that P C NP and the general consensus is that 
P is not equal to NP.) For more precise definition of NP-completeness see [GJ79]. 
The graph bisection problem can be easily shown to be NP-complete by reducing 
it to the simple max cut problem, which is known to be NP-complete [GJS76]. The 
simple max cut problem is the following problem. 

Instance : Graph G = (V, E), positive integer K. 

Question : Is there a partition of V into 2 disjoint sets such that the 
number of edges having one end in each of the two sets is at least K? 
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The proof of the following proposition is almost identical to the one given in 
[GJS76] for proving the NP-completeness of a problem slightly different from the 
graph bisection problem. 

Proposition 2.1. The graph bisection problem is NP-complete. 

Proof : We reduce the simple max cut problem to the graph bisection problem. 
Let G = (V, E) and K £ N be an instance of the simple max cut problem. Let 
|V| = n, and U = { u\, . . ., u n } be a set of new vertices. We construct an instance 
for the graph bisection problem as follows. Construct G' = (V, E') such that 

V = v u U, 

E J = {(u,v)\u,veV,(u,v)£E}, 

K' = n 2 -K. 

In a sense G' is the complement of G, with the extra vertices. 

Claim : G has a max cut of at least size K if and only if G' has a bisection of size 
less than or equal to K' . 

Suppose (A,B) is a partition of G such that \{(u,v) £ E \ u £ A,v £ B}\ > K. 
Since K > 0, \A\, \B\ > 0. Let j = n — \A\. Let 

A' = A[j{ui,...,Uj}, 
B! = V' — A'. 

Then (A'jB*) is a bisection of (?'. Also, it is easily seen that 

\(A',B')\ = n 2 -\{(u,v)£E'\ueA',veB'}\ 
= n 2 -\{(u,v)eE\u€A,veB}\ 
<n 2 -K 
= K'. 

Conversely, if (A', 0) is a bisection of G', such that \{A',&^ <n 2 — K = K'. then 
let A = A' D V and B — & fl V, clearly (A, B) is a partition of G satisfying 

\(A,B)\=\{(u,v)(EE\ueA,veB}\ 

= n 2 -\{{u,v)gE J \ueA',v£B , }\ 

>n 2 -{n*-K) 

= K. 

This proves the claim and the proposition. | 
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2.3. Known Lower Bounds on the Bisection Size of Graphs 

As indicated in the previous section the graph bisection problem is NP- 
complete, this means that at the present an algorithm to determine the bisection 
size of a graph takes time exponential in the size of the graph (e.g. the number of 
the vertices). It is, therefore, useful to know a lower bound of the bisection size of 
a given graph so that one can determine how good is an approximate algorithm 
for finding the bisection size. The first few results presented below give the lower 
bounds for bisection size based on the knowledge of the adjacency matrix of the 
graph. Let G = (V, E) be graph with |V| = 2n. Let A(G) be the adjacency matrix 
of G and let U be an arbitrary diagonal matrix of order 2n such that 

2n 

£ ua = -2 \E\. (2-9) 

t=i 

We denote by Xi, X2 the two largest eigenvalues of the real symmetric matrix 
{A + U). 

Proposition 2.2. [DH73] Let G = [V,E) be a graph with \V\ = 2n and let 
A,U,\\,\2 be defined as above, then 

C(G) > - £(Xi + X 2 ). (2-10) 

If we know the maximum degree of the vertices in G then we have the following 
stronger result than Proposition 2.2. 

Proposition 2.3. [DH73] Let G = {V,E) be a graph with \V\ = 2n and let 
A,U,\i,\2 be defined as above, and for all v £ V, deg(v) < d for some 
fixed constant d. Let 61,62 be in the closed interval [0, tt/4] ; and let x be the 
simultaneous solution to the following equations ■ 

x sin 2*i = (1 — x) sin 2*2 (2-11) 

— -(<$i + 6 2 ) = i[l — sin 2*i + 2(d — 1)(1 — cos(o"i + 62))] (2-12) 



then 



C(G) > xn. (2-13) 
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Given n and m, if we define C Him to be the maximal bisection size of all graphs on 
n vertices and m edges, then we have the following result given by Goldberg and 
Gardner [GG82]. 

Proposition 2.4.[GG82] Let n and m be given. Let s be the largest integer such 
that m > s(2n — 1) — Js(s — 1), and let r = m — 2(2n — 1) + |s(s — 1). 

C 2 «,m > [j(2n - 0] + max{ 0, r - [n - || - 1 } (2-14) 

where [a] denotes the integer nearest to a. 

Proof : We shall construct a graph G = (V,E) with |Vj = 2n, \E\ = m and such 
that C(G) equals the right hand side of (2-14). We define G as follows. 

V = {ii,...,x 2 n}, 

E = { (x it Xj) | i = 1, . . ., s; j = 1, . . ., n; if^j} 
|J {(X.+1.X,-) | j = a + 2, . . ., a + r + 1 }. 

Thus G has a clique of size s and each vertex of that clique is also connected 
to all other vertices in V. There is also one vertex outside of the clique, namely 
x s _f_i, which is connected to r other vertices not inside the clique. Let (A, B) be 
an optimal bisection of G. Assume that a is the number of vertices in the clique 
that appear in A and hence there are s — a vertices of the clique appearing in B. 
Without loss of generality assume that a < £ then a < s — a. It is clear that 

C(G) = \(A,B)\ <{s — a)n + a(n — (s — a)) -j- max{ 0, r — (n — a) — 1 } 

hence for the optimal bisection we must have a = &. That is 

C(G) = [a(n-£)]+max{0,r-[n-^j-l} 

completing the proof of the proposition. | 

For random graphs in Q(2n, m) the following lower bound is given in [Mac78]. 

Proposition 2.5.[Mac78] Let s > 9 be fixed. Let G2 n ,m be a random graph in 
§(2n,m), with m = 2sn. Then 

Pr|c(G 2n , m ) < U - ^ )m\ - (2-15) 

as n-+oo. 
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2.4. Known Upper Bounds on the Bisection Size of Graphs 

By counting argument we can easily get the following upper bound on the 
bisection size of graphs in general. 

Proposition 2.6.[GG82] Let G = (V,E) be a graph with \V\ = 2n,\E\ = m. 
Then 

C(G) < 5^. (2-16) 

Proof : Since the number of different bisections is ^P"), we have 

^( 2n W)< £ \(A,B)\ (2-17) 

2 \ n J (A,B) 

where the sum is over all possible bisections of G. It is easily shown that each edge 
in G appears ^—1 ) times in the sum on the right hand side of (2-17). Hence 
the above inequality becomes 



t¥ G ^:-iy- 



Thus 

C(G) < 2 ^ 



m mn 



CO 2n - 1 ' 

The above result holds for any graph. 

In the case of random graphs in §[2n, m) we have the following upper bound 
given in [Mac78]. 

Proposition 2.7. Choose s > 0. Let G2n,m be a random graph in Q{2n,m) 
where m = 2sn. Then with probability 1, C((?2n,m) is less than or equal to 
m(X — H(s)), as n~*oo, where H(s) ph 0.2385 - */ 2 as s-+oo. This approximation 
is also good for s as small as 1. 

This upper bound is established by constructive method. An algorithm is 
exhibited and its performance is used to derive the upper bound. In practice, this 
algorithm is found to be inferior to most other well known algorithms for finding 
bisection size. The problem is that these superior algorithms are too complex to 
enable an analytical analysis be made on their performances. Therefore, one would 
expect that even if the random variable denoting the bisection size of random 



-19- 



graphs in £(2n, m) does not converge, the actual upper bound will be lower than 
that given in Proposition 2.7. It should be noted that the upper bound given in 
Proposition 2.7 is only for random graphs in £(2n,m) with a linear number of 
edges. 

2.5. Bounds for Graphs with Special Structures 

In general if the graph possesses some special structure we can produce a 
tighter bounds on its bisection size. The following results are given in [Mac78] we 
present them here without proof. 

Proposition 2.8. Let G = {V,E) be a graph d = max{ deg{v) \ v G V } > 3, with 
\E\ = m, then 

C(G) < ~±m + 0{d\ (2-18) 

Proposition 2.9. Let G = {V, E) be a graph with \V\ == 2n, \E\ = m, if m < n 
then C(G) = 0. 

Proposition 2.10. Let s £ (1/2, 3/2) be given. Then there exists an e > such 
that for any uq there is n > n and a graph G = (V, E) with \V\ = n, \E\ < sn 
such that max{ deg{v) \ v £ V } < 3 and C(G) > en. 

Proposition 2.11. Given a tree G with n vertices and let d be the maximum 
degree of all the vertices. Then 

{l + 2\ogon, if d = 3 or 4; 

1 * 2-19 

! + §(d + l)lo g<i n, ifd>5. 

Proposition 2.12. Let G be a complete ternary tree having n vertices with 
n = i(3* — 1), for some I even. Then 

C{G) > flogg n] - riog 3 log 3 n]. (2-20) 



The above proposition also holds for odd /, but then the number of vertices in 
the tree is odd and it does not have a bisection as defined in the first section of this 
chapter. 
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In this chapter we have reviewed the known results concerning the bisection 
size of graphs. Except for the results given by MacGregor [Mac78] for random 
graphs in §(2n,m) all the other results are given only to first order term. In the 
next chapter we shall give bounds for random graphs to second order term. We 
also expand our model of random graphs to deal with random graphs having a 
nonlinear number of edges, in contrast to MacGregor's results. 
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Chapter 3 
Bounds for Bisection Size of Graphs in Q (2n,p). 



In this chapter we consider random graphs in £(2n,p). We defined Q{2n,p) 
in Chapter 2 as a probability space consists of all graphs on In vertices, and the 
probability of a graph on In vertices and m edges is p m qS 2 / m , where q — 1 — p. 
We show in this chapter that with probability 1 the bisection size of Gin,p € 5(2n,p) 
is less than n?p—0(n) and is greater than n 2 p—0(n 3 / 2 ). We then extend our model 
so that p can be a function of n (from F"J to (0, 1)). In particular we consider the 
function p = p(n) = fa. Under this model, we show that a.e. graph in $(2n,p(n)) 
has bisection size greater than en — n^/2c\og2 and less than en — 0.476c 1 ' 2 n. 

3.1. Preliminary Results 

In this section we first present without proofs some elementary results in 
probability theory which will be needed later in the chapter. Proofs of these results 
can be found in most books on probability theory, e.g. [F74]. We next present some 
fundamental results about random graphs. 

Chebyshev's Inequality Let X be a random variable, for any t > 0, we have 

Pr{|X|>*}<^P (3-1) 

In particular, if E(X) — fx then 

Pr{\X-n\>t}<^l (3-2) 

where E(X) and Var(X) are the expectation and variance of X, respectively. 
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In our proofs we need to approximate expressions which contain the binomial 
random variable. For this purpose we use the following approximation given in 
[JK69] to approximate the binomial random variable by the normal random variable. 

Pr{ S„ < k } es M( k+ j_Z nP \ (3-3) 

where 

Pr{5 n <fc}=£("y <?*-*■ (3-4) 

and 

M(x) = — f t-^dy (3-5) 

is the normal distribution function. The following approximation of the normal 
random variable given in [F74] will also be needed later. 

JV(-x) = 1 - U(x) « x~M*) (3-6) 

as x— »oo, where 

n{x) = 4= c ~* x2 ( 3 " 7 ) 

is the normal density function. In our computation we also use the following form 
of Stirling's formula for factorial 

n! =n B e -n \/27rn(l-|-o(l)) (3-8) 

from which we can easily show that 



CD 



>2n 

(3-9) 



i/^rn 



By a simple application of the Chebyshev's inequality we get the first part of 
the following result. 

Proposition 3.1. Given e > and p £ (0, 1) fixed. A.e. graph in 5{n,p) has at 
least Up — e)n 2 edges and at most Up-\-e)n 2 edges. Ifp is a function /rom INf 
to (0,1) such that n 2 p(n)-HX> and (1 — p(n))n 2 -*oo then we again have : a.e. 
graph in §[n,p(n)} has at least Up{n) — e)n 2 edges and at most %{p{n) -+- e)n 2 
edges, for any e > 0. 
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The proof of the second part of the above proposition is given in the proof of 
Proposition 3.2. 

As mentioned in the previous chapter, the models Q{n,p) and §(n,m) are 
closely related. We show now under what conditions, properties of graphs in one 
model can be translated to properties of graphs in the other model. We first 
introduce the concept of convex property. Let T* be a collection of sets, T* is 
convex if A\ C A C Ai for some A\, A?, £ T* implies A £ T*. We can then define 
a convex property of graph similarly. 

Proposition 3.2.[Bo79] Let p = p(n) be a function fromti to (0,1) such that 
n 2 p(n)— ►oo and (1 — p(n))n 2 — kx> as n—HX>, let Q be a graph property, and let 

"-GO- 

(i) Lett > be fixed, and suppose that if (I— e)Np(n) < m < (1-f- e)Np(n) 
then a.e. graph in §(n,m) has Q. Then a.e. graph in Q{n,p{n)) has Q. 

(ii) If Q is a convex property and a.e. graph in Q[n,p{n)) has Q, then a.e. 
graph in Q[n, \Np[n)\) has Q. 

Proof : Let T denote the set of graphs in Q{n,p) and let T m denote the set of 
graphs in Q{n,m). Then T = ULo r ™> ^ 

Pr(r m ) = (y)p m <l N - m (3-10) 

therefore, 

PrFm) = m + l g , , 

Pr(r m+1 ) N-mp K } 

and Pr(r m ) is maximal for some m £ [Np — 1, Np + 1]. Let e £ (0, 1) be given, 

since n 2 p— ►oo as n-^oo, for sufficiently large n we have 

Pr(r m ) f 1 - e if m < (1 - e)Np; 

Pr(r TO+1 ) \(i + e )-i if m >(1 -f e)Np. 
Hence 

/ (1-eJATp \ 

Pr U T m -+ 1 asn-+oo. (3-13) 

Vm=(l+e)Np ) 

This proves (i) and also the second part of Proposition 3.1. To prove (ii) we note 
that from (3-11) we get 

Prl U Tm I > V (3-14) 



{IL r -) 
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for any n £ (0, 1/2) and for sufficiently large n. Prom (3-13) and (3-14) we have : 
if T* C T is such that Pr(r*)->- 1 as n— >-oo then for any e > 0, there exist mi and 
7712 such that (1 — e)Np < m\ < iVp < 7712 < (1 -f- e)iVp and 

|r "^ n p - 1 asn-*oo, i = l,2. (3-15) 

|r mi | 

In particular if T* is a convex set then we have 

ElZLQp _+ 1 „„_>«,, (3-I6) 

if mi < m < rri2 , say m = [ATpJ. This proves (ii) and completes the proof of the 
proposition. | 

3.2. Lower Bounds on the Bisection Size 

We first consider model §(2n, p) with p being a fixed constant in the interval 
(0, 1), independent of n. For each G2n,p E i?(2n,p) there are %\*£ J distinct bisections. 
We label the bisections 1, . . ., ^PnJ- ket fc be an integer in [0, n 2 ]. For each i £ 
{ 1, . . ., ^Pn) } define the following random variable on Q(2n,p) 

tu\ (1, if the ith bisection has size < k: 

A^(G 2n>p )= ' ~ (3-17) 

(0, otherwise. 

We next define Bjt(G2n,p) as a random variable on §(2n,p) denoting the number 
of bisections of G2n,p having size less than or equal to fc. That is 

( 2 ;)/2 



"k^2n,p) — / , ** 

It is clear that 



Bk(G 2 n, P ) = £ AnGta,). (3-18) 

1=1 



E(AS fc) ) = £ ^y* n2 - j - (3-19) 

Now we are ready to give the lower bound for bisection size of graphs in 
S(2n,p). 

Proposition 3.3. Let f(n) be a function such that f(n) = o(l) and f(n) = fi(£). 
Then a.e. graph in Q{2n,p), where p is a fixed constant, has bisection size 
greater than or equal to 

fc(n) = n 2 p — nyjAnpq log 2 — 2pg log n — 2pq log /(n) -f- 0(1). (3-20) 
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Proof : Fix n, let k = k(n) be given as in the statement of the proposition. Let 
G2n,p G 5(2n, p) be given. We have 

Pr(C(G2n,p) < fc) = Pr(some bisections of G2n,p has size < k) 
= Pr(B fc ^ 0) 
< E(B fc ). 

Now let M = £( 2 n n ), then 

E(B*) = £ E(Ai fc) ) 

-Is(;>-' 

1 2 2n [ 1 \/ n2 P9 / 1 / fc^ 
pa 1 5 r ex Pl ( n P 

«/(n) + o(l) 

= 0(1) 

by a straightforward application of (3-3) and Stirling's formula, and by our choice 
of k. Thus 

Urn Pr(C(G 2n , p ) > k) = 1 

TV-+0O 

this completes the proof of the proposition, fl 

In Proposition 3.3 p is a constant, and hence by Proposition 3.1 the number 
of edges of random graphs in Q{2n,p), with probability 1, lies in the interval 
[(p — e)2n 2 , (p -f- e)2n 2 ] for any e > 0. If we now instead consider p as a function of 
n, in particular p = fe then again by Proposition 3.1 the number of edges of a.e. 
graph in ,p(2n,p(n) = &) lies in the interval [(2c — e)n, (2c -f- e)n] for any e > 0. In 
other words, graphs in £(2n,p(n) = ^), with probability 1, have a linear number 
of edges. By using Propositions 3.1, 3.2, and 2.5 we get the following. 

Proposition 3.4. A.e. graph in Q{2n,p), where p = p[n) = £ for some constant 
c > 9 ; has bisection size greater than or equal to 

k{n) = cn — n^2clog2 (1 + o(l)) (3-21) 
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For the general case of p a function of n we conjecture that the following holds. 

Conjecture. Let p(n) be a function from N to (0, 1) such that n 2 p(n)— ►oo as 
n— ►oo. A.e. graph in Q[2n,p[n)) has bisection size greater than or equal to 

k(n) = n 2 p(n) — 2n 3 / 2 v /p(n)< ? (n) log 2 (1 + o(l)) (3-22) 



3.3. Upper Bounds on the Bisection Size 

As in the previous section, we first consider the model Q{2n,p) with p being 
a fixed constant in (0, 1). Define the random variables A{- ' and B* as in equations 
(3-18) and (3-19). We first need the following lemma which was given by Matula 
[Mat76] as a stronger version of the so called second moment method used frequently 
in [ES74]. 

Lemma 3.5. Let X be a nonnegative, integer valued random variable with 
mean E(X), standard deviation a < oo, then 

Pr{X^0}>|gj (3-23) 



Proof : As 2ij < i 2 + j 2 , we have 

E 2 (X) = (£tPr(X = i)) 

00 oo 

= E E y*M* = Wx = j) 

= (Ey 2 Pr(X = j))f;Pr(X = t) 

== E(X 2 )Pr(X ^ 0) 

Thus the lemma is proved. | 

Proposition 3.6. Let p £ (0,1) be a fixed constant. A.e. graph ,£(2n,p) has 
bisection size less than 

k(n) — n 2 p — an (3-24) 
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for some a < ^hzsjpq- 

Proof : Fix n, let k = k(n) be given as above. It is clear from the definition of B* 
that 

It f 0\ 

„2 



E(B fc ) = M E f n . 2 W 
where M = MnJ- The second moment of B* is 



—J 



(3-25) 



E(B|) = E 



— E 






+ E 



M M 



< E(Bjt) + -E 



£ £ (a?>) W 

t=l ,==i v . / v /■ 



M M 






<E(Bjt) + (M-l)E(B fc ) 
= ME(Bifc) 

.2 / ,,>\2 



(3-26) 



as 



(a^) = A\ k) and 2A|* ) aJ* ) < (aJ* } ) + f A^] . By Lemma 3.5 we have 

E(B*) 2 



Pr(B fc ^ 0) > 
> 



EM) 
E(B fc ) 2 
ME(B fc ) 



1 \Jn 2 pq ( \t _fc\ 2> \ 

E"n 2 p-A: eXP V 2pgV nP rJ J 



for our choice of A;, the right hand side of the above expression goes to 1 as n-*oo. 
This proves the proposition. | 

Consider next random graphs in Q(2n,p{n) = ^) for some c > 1. We can use 
Proposition 3.2 to translate the upper bound given by Proposition 2.7 for graphs 
in Q{2n,m = 2cn), to an upper bound for graphs in £(2n,p(n) = ^). 
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Proposition 3.7. Letp(n) = fc for some constant c > 1. A.e. graph in Q[2n,p{n)) 
has bisection size less than 

k(n) = cn — 2H(c)cn. (3-27) 

where H{c) ^0.23Sc~ 1 / 2 . 

We conjecture that the following holds for a.e. graph in ${n,p(n)) 

Conjecture. Let p{n) be a function from IS to (0, 1) such that n 2 p(n)-*oo as 
n-+oo. A.e. graph in §(2n,p(n)) has bisection size less than 



fc(n) = n 2 p(n) — an\Jp(n)q(n) (3-28) 

From Proposition 3.1 we see that, with probability 1, graphs in Q{2n,p) have 
2n 2 p(l -J- o(l)) edges. Our results in this chapter show that about half of these 
edges will appear as cut edges in the optimal bisections. This conclusion can be 
drawn from Chapter 2, what is new here is the presence of the second order terms 
in the bounds. In fact we need these second order terms if we wish to use these 
bounds to judge the performance of a heuristic on random graphs. This is the case 
as, given a random graph, we can easily show that with high probability, a random 
bisection will have size equal to about half of the number of edges in the random 
graph. Thus any improvement by a heuristic over a random bisection will reflect 
only in the terms of order o(n 2 ). 

In the next chapter we shall present some known approximate algorithms for 
solving the graph bisection problem, and some empirical data of their performances 
on random graphs. 
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Chapter 4 
Graph Bisection Heuristics 



In this chapter we describe various heuristics that are used for solving the 
graph bisection problem. All of these heuristics have one basic working principle. 
They all start with a randomly chosen bisection, and improve upon that bisection. 
The differences among these heuristics are in how the improvement is done. The 
simplest of these is an algorithm which chooses the vertices to be exchanged by 
ordering the vertices in each part of the bisection based on the property of the 
initial bisection, and then choosing the best initial segments from each part of the 
bisection to be exchanged. The next approximation algorithm that we describe 
is given by Kemighan and Lin in [KL70], this algorithm also orders the vertices 
in each part of the original bisection and chooses the best initial segments to be 
exchanged. However, the ordering is done dynamically. That is, the order of one 
vertex depends both on the original bisection and the vertices appearing before it 
in the ordering, whereas the order of one vertex in the previous alogrithm depends 
only on the original bisection. An interesting algorithm proposed by Kirkpatrick, 
Gelatt, and Vecchi [KGV82] uses ideas in statistical mechanics to help solve the 
graph bisection problem. In the last section of this chapter we shall present some 
empirical results. 

4.1. Descriptions of Some Heuristics 

There is one obvious way to solve the graph bisection problem, namely, 
enumerating all the possible bisections and choose one that has a minimum size. 
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However, to enumerate all the bisections of a graph on 2n vertices, it takes time 
proportional to ( 2 ™j which is exponential in n, hence this method is very inefficient 
for large n. 

Another method is to generate the bisections randomly, and keep the one that 
has the smallest size among the ones that have been produced so far. This is faster 
than the exhaustive procedure described above, nonetheless, by experience [KL] 
reported that the number of optimal bisections or near optimal bisection is very 
small, hence the probability of getting an optimal solution by this method is quite 
small. In fact, [KL] has experimented with a class of 0-1 matrices of size 32 X 32 
(these can be considered as the adjacency matrices) and found that the number of 
optimal bisections are typically 3 or 5 out of a total of f 16 J bisections, which means 
that the probability of success on any trial is less than 10~ 7 . 

By considering the given graph as a network and the cost of the edges in the 
given graph as the maximum flow capacity of the edges, then it seems that the 
graph bisection problem can be solved by using the well known Ford-Fulkerson 
max-flow, min-cut algorithm. This algorithm will give not only the maximum flow 
between any two points but also the minimum cut that separates those two points. 
Unfortunately, the algorithm does not have control over the size of the subsets 
of the partition, that is it will give the min-cut but the resulting two subsets of 
the partition need not have equal size as required by the graph bisection problem. 
Furthermore, there is no obvious way to modify the algorithm so that we have 
control over the size of the subsets of the bisection. 

We now describe a simple heuristic called the block heuristic. A similar heuristic 
is used in the proof of Proposition 2.7 in fMac78}. Let G — (V,E) be a graph 
with |V| = 2n, the algorithm starts with a random bisection, say (A,B) of G. We 
define the gain of each vertex in the graph with respect to that bisection as follows. 
Let a £ A, the gain of A, denoted by g a , is the difference in the number of edges 
connecting a to vertices in B, and the number of edges connecting o to vertices 
in A. We extend this definition to pair of vertices one in A and one in B. More 
formally, define * 

g a , b = \(A,B)\ -\{A\Bf)\ (4-1) 
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where 

A' = (A-{a})(jW and B' = (B-{b})\J{a}. (4-2) 

In other words, g a> b is the reduction in the size of the bisection when a and b are 
interchanged. Clearly, 

9a,b = 9a + 9b-2t(a,b) (4-3) 

where 

*(-.») - J 1 ' ifM)6 * ; (4-4) 

lo, il{a,b)(E. 

This algorithm first calculates g v for all v E V. It then chooses a\ £ A, &i £ fl, such 
that <7oi,6i is maximum. It then computes the size of the new bisection resulted 
when ai and b\ are interchanged, and sets oi, &i aside. Next it chooses the best pair 
02, 62 in A — {ai} and B — {61} such that <7o 2) 6 2 * s maximum, then sets 02, 62 aside. 
The process is repeated until there is only one pair of vertices left. The algorithm 
then returns the bisection with the smallest size found. Note that when choosing 
02 and 62 the gains of a-i and 62 have not been updated to account for the fact that 
ai and b\ have been interchanged. That is, at all stages we use the gains computed 
at the beginning of the algorithm. The algorithm can be described more formally 
as follows. 

begin 

1. Compute g a , gb for each a(zA,b£B. 

2. LetQ A = 0,Q B = 0. 

3. Tor i = 2 to n do 

begin 

4. Choose a,i£A — Qa and fy £ B — Qb such that g^^ 
is maximum over all choices of a and 6. 

5. Set Q A = Qa U fa}, Qb = Qb U fa} 
end 

6. Choose k £ {1, . . ., n} to maximize E*_i ffo^- 

7. Interchange the subsets {aj, . . .,0^} and {61, . . ., &&} to get the new bisection. 

end 

Figure 4.1. The Block Algorithm 
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Kernighan and Lin in [KL70] give a heuristic for solving the graph bisection 
problem which seems to work well in practice. Let a graph G = (V, E), V = 2n be 
given. The main idea here is the same as before, that is to start with an arbitrary 
bisection, say (A,B), and improve upon it. The improvement is accomplished by 
interchanging subsets X C A,Y C B, and |X| = \Y\ < n so that the size of the 
bisection is decreased. If we consider all possible subsets of A and B, we shall be 
able to pick out the subsets, whose interchange will give us the optimal bisection. 
However, this will be an exponentially long procedure, which is undesirable. The 
Kernighan-Lin heuristic finds these subsets approximately by choosing elements of 
X and Y sequentially. This choosing process is done as follows. For each element 
o G A, b G B, let g a> b be defined as before. The algorithm first computes g a j, for all 
a G A, b G B. It then chooses a\ G A, b\ G B such that 

g aubl = max{ g a ,b \ a G A, 6 G B } 

The algorithm now updates the gains of all vertices in V, except a\ and 
61, with respect to the new bisection ((A — {a\}) U {&i} , {B — {61}) U {^l})- The 
algorithm next repeats the process for this new bisection and chooses a new pair 
of vertices to be exchanged, except that a\ and 61 will not be considered anymore 
in choosing the next pair that will give the maximum reduction. That is, once a 
vertex is chosen to be exchanged it will no longer be considered in later stages. 
The process is repeated untill all vertices have been considered. We now have a 
list of pairs [a\ t b\), . . ., (a n , 6 n ). Clearly if all these pairs are interchanged the total 
reduction is zero. The algorithm now chooses a k < n such that the interchange 
of the subsets {ai,...,ajt} and {6i,...,6jfc} will give a maximum reduction over 
all choices of k < n. This whole process is called a pass of the algorithm. The 
algorithm can have several passes. Each pass, except the first one, starts with the 
bisection given as the result of the previous pass. The algorithm can have a fixed 
number of passes or it can run until no more improvement is possible. Another 
alternative is to have an entire new arbitrary bisection as the input to each pass, 
and keep the smallest bisection produced so far. 

We now describe the algorithm formally. Let G = {V,E) be a graph with 
V = 2n. Let (A,B) be a bisection of G. For each a G A b G B define g a ,b as 
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before. The heuristic is shown in Figure 4.2. Steps 7 and 9 of the algorithm can be 
easily checked that the values of the g a and g\, are correctly updated with respect 
to the new bisection, that is after the sets {ai, ...,ai} and {b\,...,bi) have been 
interchanged. It is also easily shown that the running time of the algorithm is 
0(n 2 logn). 

begin 

1. Compute g a , gb for each a £ A, b £ B. 

2. <3 A = 0,Q fl =0. 

3. for i = 2 to n do 

begin 

4. Choose ai £ A — Qa and bi £ B — Qb such that g^^ 
is maximum over all choices of a and b. 

5. Set Q A = Qa U M, Q b = Qb U {&<} 

6. for each a £ A — Qx do 

7. Ja = Ja + 26(a, a) - 25(a, 6j) 

8. for each b £ B — Q B do 

9. < ?6 = <fc + 2%&i)-2%a i ) 
end 

10. Choose A; £ {1, . . ., n} to maximize Hi = \ Onfa- 
ll. Interchange the subsets {ai, . . ., a*} and {bi, . . ., &*} to get the new bisection. 
end 

Figure 4.2. One pass of the Kernighan-Lin bisection heuristic 



Other graph bisection heuristics which are variations of the Kernighan-Lin's 
heuristic have been considered by Macgregor [Mac78], he also considered hybrid of 
these heuristics. It should be noted that a slight variation of the Kernighan-Lin 
heuristic has been implemented by Fiduccia and Mattheyses [FM82] to run in linear 
time by using some clever data structures. 

Finally, we present an algorithm proposed by Kirkpatrick, et al., [KGV82], 
which makes an interesting connection between the annealing process and the 
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iterative improvement process of the graph bisection heuristics. Consider a system 
consisting of a large number of atoms, such as a sample of liquid or solid 
matter. The aggregate behaviour of the system can be observed by considering 
the average behaviour taken over an ensemble of identical systems. We associate 
with each configuration of the system in the ensemble a Boltzmann's probability, 
exp(— E({ri})/ksT), where E({ri}) is the energy of the configuration defined by 
the atomic positions {r^}, fcjg is the Boltzman's constant, and T is the temperature. 
One wishes to know what happens to the system in the limit of low temperature, 
for instance, whether atoms remain fluid or solidify. It is known that ground states 
and configurations having energy close to them are very rare, nonetheless, they 
dominate the behaviour of the system at low temperature because as T is lowered 
the Boltzmann distribution collapsed into the lowest energy state or states. To find 
the low temperature states of a system it is necessary to use an annealing process. 
That is to first melt the substance, then lower the temperature slowly, and spend 
a long time at the temperatures near the freezing point. Otherwise, the resulting 
configuration will be metastable. 

There is a simple algorithm given by Metropolis, et al. [M53] which simulates 
a collection of atoms in equilibrium at a given temperature. In each step of the 
algorithm, an atom is given a small random displacement, and the corresponding 
change in energy, AE, of the system is computed. If AE < 0, then the displacment 
is accepted, and the configuration with the just displaced atom is used as the starting 
configuration for the next step. When AE > 0, the configuration is accepted with 
probability Pt(AE) = exp(— AE/1cbT). A random number is chosen uniformly in 
the interval (0, 1), and compared with Pr(A.E). If it is less than Pt(AE) then the 
new configuration is accepted, if not, the original configuration is retained and we 
repeat the process. 

It is observed in [KGV82] that the iterative improvement process in a 
combinatorial optimization problem such as the graph bisection problem is similar 
to the microscopic rearrangement processes modelled by statistical mechanics, where 
an appropriate cost function for the graph bisection problem will play the role of 
energy. Using this analogy we note that in the process of finding the solution if we 
only accept rearrangements that reduce the cost function, then this is like rapid 
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quenching from high temperature to T = 0, thus the resulting solutions will often 
be local optima and metastable. By utilizing the Metropolis' algorithm described 
above, in which rearrangements that increase the cost function are sometimes 
accepted, we can expect to get better solutions as indicated by the observation 
made in actual physical processes. This algorithm has not been extensively tested 
but the results found so far seem very favourable. 

4.2. Some Empirical Results 

In this section we give the results produced by the block and Kernighan-Lin 
algorithms when random graphs in §(n, p = 1/2) are used as inputs. We ran both 
algorithms on graphs in S{n,p = 1/2) as n varies from 20 to 800 in increments of 
20 and from 800 to 1000 in increments of 50. For each value of n 10 random graphs 
are generated, the results produced by each algorithm on these random graphs are 
averaged and used as the result of that algorithm for that value of n. 

To speed up the running time some modifications are made in both algorithms 
in implementing them. In both algorithms the most time consuming step is in 
choosing the best pair to be interchanged. Assume that the current bisection is 
{A, B), instead of considering each pair (a, b), a € A, b £ B to find one with the 
largest g a fi (defined in the previous section), we simply find the pair (a, b) such 
that g a -f- gb is largest, i.e., we can choose a and b independent of each other. In 
the block algorithm we further reduced the running time by considering blocks to 
be exchanged only of length less than n/2 for graphs with 2n vertices. That is, 
in step 3 of the block algorithm, the for loop runs from 2 to [n/2] instead of n. 
This is due to the observation in some experimentation that the length of the block 
to be exchanged in the block algorithm is almost always less than one fourth the 
number of vertices in the graph. The change in the performance as a result of these 
modifications is not significant, as observed and also reported in [Mac78]. 

For large value of n the Kernighan-Lin algorithm seems to perform quite better 
than the block algorithm but the running time is also longer. The bisection sizes 
produced by the Kernighan-Lin algorithm apparently lie well in halfway between 
the upper bounds and the lower bounds. The programs are written in MACLISP, 
and run on a DEC20. On an input of 1000 vertices the Kernighan-Lin algorithm 
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takes approximately 283 seconds and the block algorithm takes approximately 240 
seconds. 




NUCER OF VERTICES 



Figure 4.3. Graph oT Vertex Size vs. Bisection Sizes 
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In Figure 4.3 the top curve is the upper bound given in Chapter 3, the curve 
second from the top is the bisection size produced by the block algorithm, the next 
curve represents the bisection sizes returned by the Kernighan-Lin algorithm, and 
the bottom curve is the lower bound given in Chapter 3. The graph was prepared 
using the MACSYMA program developed by the Mathlab group at MIT 2 [MAC77]. 



2 Work of the Mathlab group is currently supported in part, by the United States Energy 
Research and Development Administration under Contract Number E(ll-l)-3070 and by the 
National Aeronautics and Space Administration under Grant NSG 1323. 



-39- 



This empty page was substituted for a 
blank page in the original document. 



Chapter 5 
Conclusion 



In this thesis we have shown that almost every graph in £(2n,p) has bisection 
size greater than n 2 p — Ol n 3 / 2 ^/pq j and less than n 2 p — Oy n^/pq ]. We also show 
that a.e. graph in j?(2n,p(n) = £) with c > 9 has bisection size greater than 
en — nyJ2c log 2 and less than en — QAl^l^n. These, together with other known 
results on bisection size, seem to indicate that, except for specially structured 
graphs, we always have to remove about half the number of edges of a graph 
to bisect it. Empirical results also give evidence that there is no heuristic that 
performs well on all kind of graphs [Mac78]. This suggests that if one wishes to 
obtain optimal or near optimal bisection of a graph, one needs to know more than 
just the number of vertices and edges. Knowing about the structure of the graph 
alone is not sufficient, however, one also needs special algorithms to exploit that 
structure. 

Efforts in proving how good is the performance of a graph bisection heuristic 
on random graphs have been in general unsuccessful. Even though most graph 
bisection heuristics are deterministic, except that choosing the starting bisection is 
done randomly, they are still very difficult to analyze. The difficulty lies in the fact 
that the updating process and the interchanging of vertices change the distribution 
of the edges in a random graph. It is useful to have provably good graph bisection 
heuristics, for besides from being of interest in itself, it also has been shown by 
Leighton [L82] that by using a provably good graph bisection heuristic, we can 
get provably good algorithm for the crossing number problem, and from which we 



-41- 



can get provably good -algorithm for the graph layout problem. Given a graph, the 
assingment of the nodes and edges of the graph to the points and tracks (vertical 
and horizontal) of a rectangular grid is called an embedding of the graph. The 
graph layout problem is the problem of finding an embedding of the graph such that 
the following quantities of the grid are minimum : area, number of edge crossings, 
number of total edge length, and the maximum edge length. The crossing number 
of the graph is the minimum number of edge crossings over all possible embeddings 
of the graph. The crossing number problem is then, of course, the problem of 
determining the crossing number of a given graph. Both the layout problem and 
the crossing number problem are known to be NP-complete. The layout problem is 
one of the vital parts in the VLSI design process, especially of chips which can do 
a large amount of computation reliably and efficiently. 

Besides from the conjectures in Chapter 3, our main open question is to 
determine analytically how well a certain graph bisection heuristic does. In 
Proposition 3.4 we show the lower bound on the bisection size for random graphs 
having a linear number of edges, in particular this result only holds for the number 
of edges being greater than nine times the number of vertices. Recent experimental 
results by Goldberg and Gardner [GG82a] indicate that random graphs with the 
number of edges equal the number of vertices have bisection size between 25% and 
30% of the number of edges. Thus it is of interest to know what is the lower bound 
for the bisection size of random graphs when the number of edges is less than nine 
times the number of vertices. 

To more closely model the graph partitioning problem arising in actual 
application a more complicated model is needed. One possible model is the set of 
random hypergraphs. It would then be useful to have results for random hypergraphs 
similar to the known results for the random graphs. Also in actual application we 
usually need to partition the graph into more than two parts, and at the moment 
there is no good uniform method for extending the known graph bisection heuristics 
to solve the general graph partitioning problem. These are a few possible directions 
for further study concerning this subject. 
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